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Abstract. Several implications of well-known fluctuation theorems, on the statistical 
properties of the entropy production, are studied using various approaches. We begin 
by deriving a tight lower bound on the variance of the entropy production for a given 
mean of this random variable. It is shown that the Evans-Searles fluctuation theorem 
alone imposes a significant lower bound on the variance only when the mean entropy 
production is very small. It is then nonetheless demonstrated that upon incorporating 
additional information concerning the entropy production, this lower bound can be 
significantly improved, so as to capture extensivity properties. Another important 
aspect of the fluctuation properties of the entropy production is the relationship 
between the mean and the variance, on the one hand, and the probability of the 
event where the entropy production is negative, on the other hand. Accordingly, 
we derive upper and lower bounds on this probability in terms of the mean and 
the variance. These bounds are tighter than previous bounds that can be found 
in the literature. Moreover, they are tight in the sense that there exist probability 
distributions, satisfying the Evans-Searles fluctuation theorem, that achieve them with 
equality. Finally, we present a general method for generating a wide class of inequalities 
that must be satisfied by the entropy production. We use this method to derive several 
new inequalities which go beyond the standard derivation of the second law. 
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1. Introduction 

It has been recently realized that time reversal symmetry implies that some exact results, 
termed fluctuation theorems, hold for systems which are driven arbitrarily far from 
thermal equilibrium pQ. 

As an example, which will be of particular interest in this paper, consider an isolated 
system in a certain equilibrium state, which is driven to another state by varying an 
external control parameter A over a time interval < t < T, according to a certain 
protocol {A(t), < t < T}. Following the application of the protocol, a difference a is 
developed between the entropy at the final state and the entropy at the initial state of 
the system. This difference is also called the entropy production. One can now imagine 
the same system being driven using a time-reversed protocol \(t) = X(T — t). When the 
time-reversed protocol obeys X(t) = \{t), it can be shown that time reversal symmetry 
implies the Evans-Searles fluctuation theorem [2J [3] 



where p(a) is the probability density function of the entropy production a. This 
immediately yields 



where as usual, we assume that the entropy change in the system is much smaller than 
its total entropy. As is well known, the second law of thermodynamics, (a) > 0, is easily 
obtained by applying Jensen's inequality, (e~ CT ) > e~' CT ', to the left-hand side of eq. (J2J). 
Other, directly related examples of fluctuation theorems include the Jarzynski equality, 
the Crooks theorem, and the Gallavotti-Cohen relation [SI El 13 El HOI [HI H21 US] - 

Beyond their elegance and their pure academic value, some of the fluctuation 
theorems have been suggested as possible tools for measuring equilibrium quantities 
from averages over repeated experiments of a non-equilibrium process pQ, which is a 
considerably interesting idea. For instance, the Jarzynski equality paves the way to 
measure free-energy differences from non-equilibrium measurements of work performed 
on (or by) the system. However, as is evident from eq. ([2]), events with a negative 
entropy production play an important role, which makes such a measurement impossible 
for macroscopic systems. 

As mentioned above, when Jensen's inequality is applied to eq. (T5]), the second law 
is immediately obtained. Moreover, since the exponential function is strictly convex, 
Jensen's inequality becomes an equality if and only if a is a degenerate random variable, 
which takes on the value a = with probability one. This corresponds to a perfectly 
reversible process where no entropy is produced. In any other case, where Jensen's 
inequality is a strict one, a becomes a non-degenerate random variable, and hence 
it fluctuates about its mean, which then must be strictly positive. Thus, eq. (J2]) 
already tells us an important qualitative fact: Irreversibility, strictly positive mean 
entropy production, and fluctuations (e.g., in terms of positive variance) appear always 
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together. However, eq. 02]) does not immediately tell us much in the quantitative level. 
In particular, when applying Jensen's inequality, it is felt that a great deal of valuable 
information, concerning the statistics of the entropy production, is lost. 

It is the purpose of this paper to explore some quantitative restrictions on the 
probability distribution of the entropy production that arise from the fluctuation 
theorems. Throughout most of the paper, we consider systems which obey the Evans- 
Searles fluctuation theorem, i.e., systems that satisfy eq. flTJ). Some restrictions are 
rather obvious. Consider for example the mean entropy production 



In the small mean entropy production limit, one can expand eq. (|2j) to leading order 
and obtain the simple relation v = 2m. The same restriction holds, of course, also in 
the Gaussian case (even without taking the small mean entropy limit), as only the first 
two cumulants are non-zero in this case. 

In this paper, we derive the most stringent restrictions, that can possibly be posed 
by the fluctuation theorems, on the shape of the distribution. In particular, we first 
derive a lower bound on the variance v of the entropy production a as a function of 
the mean entropy production m. The bound we derive is tight in the sense that there 
is a probability distribution of the entropy production, which achieves the bound with 
equality. It turns out, from this bound, that the Evans-Searles fluctuation theorem 
alone imposes a meaningful lower bound on the variance v only when the mean entropy 
production m is very small. It is nonetheless demonstrated that upon incorporating 
additional statistical (and/or physical) information concerning the entropy production, 
this lower bound can be significantly improved, so as to capture extensivity properties, 
which mean that in the thermodynamic limit, it is plausible that both m and v should 
scale linearly with time and with the system size, and so, they should be proportional 
to each other. 

Next, we derive upper and lower bounds on the probability of negative entropy 
production, Pr{a < 0}, as a function of the mean entropy production m and the 
variance v. To the best of our knowledge, these bounds are better than related bounds, 
previously derived using Jensen's inequality [H] . They are again tight in the sense that 
there exist probability distributions, satisfying the Evans-Searles fluctuation theorem, 
which achieve them with equality. The interesting fact, in this context, is that we are 
actually obtaining a bound on the large deviations rate function of the probability of the 
rare event {a < 0}. As will be shown later, if a is an extensive random variable, then 
this probability can decay no faster than e~ m , which is exponential in the system size (or 
time). The upper and lower bounds are also universal (in the probability distribution 
of a) and the important fact is that they provide non-trivial assessments on Pr{er < 0}, 
a quantity which is not easily measurable by experiments, in terms of m and v, which 
are measurable in principle. 



m = 




(3) 



and the variance 




(4) 
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Our last result is about a general analysis tool to be applied to eq. ([2]) in order to 
derive a wide class of inequalities that involve the entropy production. These inequalities 
are, in general, more powerful than the standard Jensen inequality, and some of them 
lead to certain variations of the second law, (a) > 0. For example, among other things, 
we prove that the probability of the event {a < a}, for any deterministic parameter a, 
is bounded from above according to 

Pr{a< a} < exp {(a) | CT < a } , (5) 

where the average is conditional on a < a. Thus, the second law is obtained as a special 
case, with a — > oo. Another example of an inequality from this class is 

(ae-°) < 0, (6) 

which tells us that although a is non-negative on the average, as the second law asserts, 
the event of negative entropy production still has enough probabilistic weight so as to 
make the average of a, weighted by the function e~ a (which favors negative values of 
a), negative rather than positive. 

The remaining part of the paper is structured as follows: In Sec. 2, we derive lower 
bounds on the variance of the entropy production given the mean. In Sec. 3, lower 
and upper bounds are derived for the probability of rare events with a negative entropy 
production. In Sec. 4, we present the method for deriving inequalities as mentioned 
above. Finally, we conclude in Sec. 5. 



2. A lower bound on the variance of the entropy production 

Consider a system which obeys eq. ([1]), with (a) = m, where m is given. First, observe 
that since p(—a) = e~ a p(a), we have 

/+oo r+oo 
da-p(a) = dap( ( x)(l + e- CT ) = l, (7) 
-oo JO 

which means that instead of considering a real valued random variable a, taking 
both positive and negative values, and distributed according to p(c), we can consider, 
equivalently, a positive random variable, distributed according to 

q(a)=p(a)(l + e~n, cr>0. (8) 
The mean m can then be expressed, in terms of q, according to 



m 



-oo 

' + 00 



da- 


■ ap(a) 


(9) 


da- 


■p(a)a(l-e-°) 


(10) 


da- 


1 - e~ CT 
• q(a)a ■ 


(11) 


da- 


■ q(a)a tanh 


(12) 


da- 




(13) 
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where we have defined 

/(<r) = ataiih(~). (14) 
Similarly, the second moment is 

"+oo , . - - 

2 



a 2 ) = j da- a 2 p(a)(l + e~ a ) = j da- a 2 q(a). (15) 

In simple words, we have used the Evans-Searles fluctuation theorem in order to 
transform a two-sided random variable a, governed by p(cr), into a one-sided random 
variable (a > 0), whose probability density function is given by q(a) = p(a)(l + e~ a ) and 
we henceforth denote expectations under p and under q, by (-) and (•) , respectively. 
Thus, the constraint (a) p = m, in the domain of the original two-sided random variable, 
is equivalent to the constraint (/(cr)) = m in the domain of the transformed, one-sided 
random variable. 

Since / is a monotonically strictly increasing function for cr > 0, it is clearly 
invertible in this range, and we shall denote the inverse function of / by h. I.e., \x = f(a) 
if and only if a = h(fi). We next observe that h 2 (fi) = [h(fi)] 2 , which is obviously the 
inverse function of f(y/b~), is a convex functionJ|| Thus, we readily obtain the following 
lower bound on the second moment in terms of m: 

V\ = (a 2 \ (16) 



q 

.2/ 



*W))>_ (17) 



> h 2 ((f(cr)) q ) (It 



= h 2 {m), (19) 

where the inequality follows from the application of Jensen's inequality to the convex 
function h 2 . Equality is obtained when a is deterministic under q, i.e., 

q(a)=q*(a) = 5(a-h(m)), (20) 

or equivalently, 

p(*) = p» - • 6 ^ - ^H) + it^m • 6{a + h{m)) - (21) 

Using the above result, we see that the variance of the entropy production can be 
bounded by 

v>h 2 {m)-m 2 . (22) 

This is the central result of this section. The function h 2 (m) — m 2 is depicted in Fig. [TJ 
We comment that the above analysis holds, not only for the second moment. It can 
be generalized straightforwardly from the second moment, (cr 2 ), to every higher moment 
of the form ^|cr| fc ^>, where k is any real number larger than 2 (i.e., k does not have to be 
an integer). The more general result is then 

\a\ k ) > h\m) = [f-\m)] k , (23) 



| This follows from the fact that f{yfcr) is monotonically increasing and concave for cr > 0, as can 
easily be checked from the derivatives of this function. 
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where equality is universally achieved by the same density function p* as before. 

Returning to the second moment, it is easy to see that h(x) > x (with equality only 
at x = 0), so the lower bound on the variance is positive. For small m, h 2 (m) rs 2m, and 
so, to this order, v > 2m, in agreement with the simple relation obtained by expanding 
eq. ([2]). For large values of m, we have to leading order h(m) = m(l + 2e _m//2 ) and 
the bound decays as 4e _m//2 . This behavior is easy to understand since the minimum 
variance distribution, for a given m, is achieved by a pair of delta functions, as seen 
above in eq. ( 12TT) . For large values of m, the contribution of the Dirac function at the 
negative value of a becomes essentially irrelevant to the variance, due to the exponential 
weighting of the Evans-Searles theorem. As advertised in the Introduction, in this 
context, the fluctuation theorems provide useful information on the variance of the 
entropy production distribution only for relatively small values of the mean entropy 
production. 

The reason for this behavior is simple: The Evans-Searles fluctuation theorem 
merely relates the probability density p at negative values of o to those at the 
corresponding positive values, but as m grows without bound (the thermodynamic 
limit), most of the probability mass goes for positive values of a anyway, and so, 
the information concerning negative values becomes essentially irrelevant. Therefore, 
it is understood that the Evans-Searles theorem alone cannot possibly give useful 
information about the thermodynamic limit, and this is not because of a possible 
weakness in the derivation of the lower bound (which is tight, in the sense of being 
achieved by p*). This means that in order to obtain more meaningful bounds, one must 
incorporate additional information. 




0123456789 10 
Meanm 



Figure 1. A plot of the lower bound on the variance, h 2 (m) — m 2 , as a function of m. 
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To demonstrate this fact, suppose that the additional information we have is given 
by the extra constraint 

Pr{M < r} = b, (24) 

where r > and b e [0, 1] are giverj§| and where we will assume that /(r) < m. How does 
the lower bound on the variance v change in the presence of this additional constraint? 

Once again, we refer to the one-sided distribution q. The two constraints now 
read (/(cr)) = m and q{a < r} = b. Next, we note that every probability measure q, 
satisfying the two constraints on the positive reals, can be represented as a mixture of 
two probability density functions, q\ and q 2 , as follows: 

q(a) = b qi (a) + (l-b)q 2 (a), (25) 

where the support of qi is (a subset of) the interval [0, r] and the the support of q 2 
is (a subset of) (r, oo]. We next denote expectations with respect to q\ and q 2 by 
and (-) 2 , respectively. Let us define mi = (/(cr)) 1 and m 2 = (f(o~)) 2 , where, of course, 
brrii + (1 — b)m 2 = m, m% < f(r). Now, from the same considerations as in the proof of 
the first bound derived above, we have 



a 2 



1 > h\ mi ) (26) 
a 2 ) 2 > h\m 2 ). (27) 

_2\ _ u /_2\ , t-i u\ I J1 S 



and 



This implies 

> bh 2 {m x ) + (1 - b)h 2 (m 2 ) 
= 6/i 2 (mi) + 

>bh 2 (f(r)) + 

br 2 + (1 - b)K 2 ( "\"_ J ^ J ) , (28) 

where the second inequality follows from the limitation mi < f(r) and the convexity 
of h 2 , which implies that the minimum of bh 2 ( mi ) + (1 - b)h 2 (^^zf 1 ), subject to 
the constraint mi < f(r), is achieved for mi = f(r) (provided that f(r) < m, as 
assumed). The above lower bound is achieved as both inequalities in the last chain 
become equalities, and this is the case for 

q(a) = b5(a - r) + (1 - b)5 (a - h fej^ ) ) ' (29) 

which means two pairs of Dirac delta functions in the domain of the original, one-sided 
random variable, governed by p. 

§ In practice, this additional information may be a result of an experimental measurement. 
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In view of the foregoing discussion on the first lower bound, it is now interesting to 
examine what happens in the limit of large mean entropy production to. In this regime, 
it makes sense to let r increase with to, so as to keep b approximately constant and thus 
to avoid a situation where the event |er| < r becomes a rare one. In the lower bound, 
when both r and to — bf(r) are large, f(r) ~ r and 

/ m - bf(r) \ _ m - bf(r) _ m-br 

I 1-6 J ~ 1-6 ~ 1-6 lJ 



and so, 



o 2 >>^ + (l-&)(j^) 
6r 2 + m 2 — 2mrb 

= T3j < 31 > 

= m * + (32) 

which means that the asymptotic lower bound on the variance is 

b(r — m) 2 , , 

" a TT- (33) 

This is especially suitable in situations where central limit theorem arguments hold, 
and then, in the vicinity of the peak, the distribution can be approximated by a 
Gaussian with a mean proportional to the variance (both being extensive variables). 
Then b remains approximately constant if the dependence of r upon m is chosen to be 
r = r(m) = m — c^/rn, where c is some arbitrary positive constant. In this case, the 
lower bound fl33l) becomes 



bc 2 m 

v > — b , (34) 

which is indeed extensive (proportional to to), as may be expected. This is very different 
from the earlier lower bound, which vanishes as to tends to infinity. 

3. Bounds on the probability of negative entropy production 

Eq. (JTJ) embeds a certain symmetry property of the entropy production about the origin. 
In this section, we employ this symmetry to derive upper and lower bounds on the 
probability of the (rare) event that the entropy decreases. This serves as another 
measure of fluctuations in the entropy production. Namely, we derive bounds on 
Prjcr < 0} = J ^ oo dap(a), which depend merely on the mean to and the variance 



4T 



For the lower bound, we use the simple inequality 

e~ x > e~ a -e~ a {x-a), (35) 

|| More precisely, our lower bound will depend merely on m, whereas the upper bound will depend on 
both m and v. 
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which follows from the simple fact that the exponential function is lower bounded by 
the afflne function tangential to it at any point (a, e~ a ) on the curve, where a is an 
arbitrary real parameter, whose value is left for our choice. Using this inequality, we 
now have 

Pr{ff < 0} = / da-p(a) 

J —CO 

roo 

= / dae- a ■ p(a) (36) 
Jo 

roo 

> / da[e~ a - e~ a (a - a)} ■ p(a) (37) 
Jo 

= (1 + a)e~ a Pr{a > 0} - e~ a / da ■ ap(a) (38) 

Jo 

rco 

= (1 + a)e- a [l - Pr{a < 0}} - e' a da ■ ap(a), (39) 

Jo 

and so 

Pr{. < 0} > (1 + "- m t )e ' Q = l + a ~ m \ (40) 
L _ J ~ l + (l + a)e- Q e a + a + l v ' 

where we have denoted m + = J °° da ■ ap(a). Since this holds for any real a, then 

Pr{a < 0} > sup (41) 
a e a + a + 1 

The maximum is achieved for a = ip(m + ), where ip(-) is the inverse of the function 
(f)(a) = a/(l + e~ a ). This yields 

e^( m +) + ^(m + ) + 1 v y 

In the the thermodynamic limit, m + w m and ct ~ m + , and so, 

Pr{a < 0} > e- m . (43) 

L ~ 1 ~ e m + m + 1 

We observe then that in the thermodynamic limit, if m is an extensive variable, the 

probability of a negative entropy production cannot decay exponentially faster than 

e~ m . While this lower bound is universal (in the sense of being independent of the 

actual probability distribution of a), it is nevertheless a tight bound in the sense that it 

is achieved by a certain distribution that satisfies the Evans-Searles fluctuation theorem 

(again, given by a pair of Dirac delta functions). 

Next we derive an upper bound on the probability of negative entropy production. 

To this end, we use the following inequality which applies for any x > and a > 0: 

e -x < £ -a _ e -a( x g ( x _ a y ( 44 ) 

where 

cr 

This upper bounds the exponential function e~ x by a quadratic function, tangential to 
the exponential function at the point x — a, where the coefficient g of the quadratic 
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term is chosen so as to keep the quadratic function above the exponential function for 
every positive x. Denoting s + = J °° da ■ a 2 p(a), we have 

POO 

Pr{a < 0} = / dae- a ■ pier) (46) 
Jo 

poo r 1 

< / da e~ a - e~ a (o - a) + g(a 2 - 2aa + a 2 ) • p(a) (47) 
Jo 1 J 

= [(a + l)e~ a + a 2 g}Pi{cr > 0} - e~ a m + + g(s+ - 2am + ) (48) 
= Pr{a > 0} - e' a m + + g(s + - 2am + ) (49) 
= 1 - Pr{(j < 0} - e~ a m+ + g{s + - 2am + ) (50) 

and so, 

Prja < 0} < . (51) 

Since this holds for every a > 0, 

Pr{a < 0} < inf ^ + 9* + - 2agm + - e-m + 

a>0 2 

= \ + \' v ^-l9s+-2agm + - e~ a m + ). (53) 

Z Z a>0 

A minimization over a yields a = s + /m + so that, 

gs + — 2agm + — e~ a m + = gs + — 2gs + — e~ a m + (54) 

= - (gs + + e~ a m + ) (55) 
= - (#s + + m + e^+ /m +) (56) 



where 



Thus, 



m 2 , ( m+ m 2 . 



S 2 ! \ s+ s?_ 



g- s +/ m +_ 



2 

+ m+e- s+/m+ = (l - e - s+/m+ J (57) 

p r { a < o} < I - ^ + !?4 e - + /«H-. (58) 

Similarly as the lower bound, the upper bound too is universal and tight in the sense 
defined above. Here, in the thermodynamic limit, m + — > m and s + — > s = v + m 2 , 
which yields 

Pl{a < 0} < i (l - =!) + £ e -M (59) 

Using the bound obtained in the previous section (see eq. (12"2"|) and the discussion 
following it), one may wonder what is the range of values that this bound can possibly 
take for a given values of m and v. One can check that the minimum of the bound, 



and so, 
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in the large m limit, is given when the variance is minimum, i.e., v = 4e _m//2 , so that 
to leading order, it is 2e~ m l 2 / m 2 . Its maximum value is achieved when v 3> m 2 , and 
it behaves like v/(2s) = l/(2m 2 /v + 2). For comparison, we note that the well-known 
Chebychev inequality (that also bounds the probability of interest in terms of m and v) 
yields 

Pr{a < 0} = Pr{m - a > m} (61) 

< (62, 
m 

= — 2 , (63) 
which is a weaker upper bound, that does not make use of the fluctuation theorem. 



4. Inequalities based on Jensen's inequality with a change of measure 

As mentioned already in the Introduction, it is common practice to go from eq. (J2J) to 
the second law via Jensen's inequality. As was also said in the Introduction, a great 
deal of information is lost by this application of Jensen's inequality, while it is felt that 
eq. (EJ) has much more to tell. 

In this section, we present a method by which a variety of stronger and more 
informative inequalities can be generated from eq. (EJ) by applying Jensen's inequality 
in a somewhat more sophisticated way, which allows a change of the probability measure, 
where the new measure is subjected to optimization. Unlike the usual use of Jensen's 
inequality, here no information is lost at all since eq. ([2]) can be 'recovered' as special 
case. 

First, let us recall the elementary fact that the entropy production a is actually 
a function of the random path (or the trajectory) x = {x t : < t < T} taken 
by the system in phase space as time runs from t — to t — T, where Xt is the 
microscopic state at time t. In other words, a should actually be denoted by a(x). 
For example, in the case of Markovian dynamics of a system controlled by an agent, 
o~(x) = (3Wd(x) = f3[W(x) — AF], is the entropy production due to dissipated work 
Wd(x) at fixed temperature 1//3, where W(x) is the total work and AF is the free- 
energy difference. This is then Jarzynski's equality. 

Let the probability law, that governs the trajectory x, be denoted by P. Let 
Q be an arbitrary alternative probability measure for x. To avoid ambiguities, we 
denote, until further notice, expectations with respect to P and Q by (-) p and (-)q, 
respectively. Consider now the following chain of inequalities, where before applying 
Jensen's inequality, we change the underlying probability measure from P to Q, and 
thereby obtain an expression that depends on Q: 

= J dxP(x)e- a W 

= J dxQ(x)e-^ x)+1 ^ p ^ Q ^ 
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= / e -cr(X)+HP(X)/Q(X)]^ 

> e -{*) Q -D{Q\\P) . ( 64 ) 

Here we have introduced the relative entropy D(Q\\P) = (In [Q(x)/P(x)])q. This is 
equivalent to the inequality 

(*} Q + D(Q\\P)>0 (65) 

which holds true for every probability measure Q. For Q = P, we are, of course, back 
to the second law. However, a stronger inequality is obtained upon minimizing the 
left hand side with respect to Q across some set of probability measures that includes 
Q = P. The global minimum of the left hand side among all probability measures 
(which is attained by Q*(x) = P(x)e~ a ( x )) turns out to be zero, and so, this leads 
to an uninteresting trivial identity. However, if the left hand side is minimized across 
some smaller set of measures (e.g., those that maintain moments of certain functions 
or physical quantities), then one still obtains a non-trivial inequality, which is stronger 
than the ordinary second law. 

Another useful way to present inequality ( 1651) is the following: Consider an arbitrary 
non-negative path function * = ^(x) with (\I/) P G (0, oo) and let us select 

_ PjxMx) _ PjxMx) 
^ ' - fdx'P{x')^{x') ~ (*) p • 1 ' 



Then, obviously, 



<'>«= wf- (67) 



and 



D{Q\\P) = J dx-Q(x)\og 



Q{x) 

P(x) 



, P(x)V(x) *(aj) 
ax ■ — — • log ■ 



1 «*log*) p -(*) p log(*) p ). (6* 



Wp 

On substituting these expressions back into (1651) . we obtain the following inequality, 
which is the main result of this section: 

(*<r) >(#)• log log*), (69) 

where we have omitted the subscript P from the expectation operator since now, all 
expectations are taken again with respect to the original measure P. This extends 
the inequality (a) > (the second law) by correlating cr(x) with an arbitrary random 
variable *(x), measurable on the path. In other words, we now have a family of bounds 
on 'projections' of cr(x) in the 'directions' of all non-negative path functions *(cc), and 
not just in the one direction pertaining to *f>(x) = 1, as in the second law. The right 
hand side of this inequality depends solely on the statistics of *, and it is never positive 
due to the convexity of the function f(t) = tlogt for t > 0. 
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The above lower bound is tight in the sense that there is a choice of ^>(x) for which 
the inequality becomes an equality, and this is ^(x) = e~ a ( x \ Since the left hand side 
of the inequality is (ae~ a ), for this choice of and it is identical to the right hand side 
which is non-positive. It follows that 

[o-e~ a ) < 0. (70) 

Note that the last inequality can also be derived directly from the non-positivity of 
the right hand side of eq. fl69l) . This means (similar to the usual interpretation of eq. 
(J2])) that although a is non-negative on the average, as the second law asserts, there is 
enough probabilistic weight to paths {x} for which a < 0, so that the last inequality 
must hold. 

A slightly more general case arises for ^ = e~ acr (a being an arbitrary real 
parameter), which yields 

{o-e- a °) > (r-'" T )log(f-' , ' T )M n (*<-"") . (71) 

or equivalently, 

> (e-)log(e-) o<1 
' 1 — a 

«-) < <£T!>J2i<^!) „ > ! . (72) 

1 1 — a 

Once again, the ordinary second law is obtained by substituting a = in the first of 
these two inequalities, which pertains to the case a < 1. 

Yet another interesting choice is *S?(x) = I{x : o~(x) < a}, where X{-} is the 
indicator function of the event in the braces and a is an arbitrary real parameter. In 
this case, our inequality tells us something about the conditional expectation of a given 
that o~(x) < a: 

(a) > logPr{a < a}, (73) 

where the ordinary second law is recovered as a — » oo. It may be interesting to look at 
the last inequality as an upper bounded on Pr{cr < a}: 

Pr{<7<a}<exp{(<7)| ff < tt }, (74) 

which is interesting when (a) \ a < a < 0. For a < 0, this is certainly the case. Note also 
that this is tighter than a straightforward use of the Chernoff bound [THUS], according 
to 

Pr{a <a}< (e^) = e a (75) 
because (a) | CT < Q cannot exceed a. 



5. Conclusion 



In this paper, we have derived a series of bounds related to properties of the 
entropy production that stem from fluctuation theorems. These bounds illustrate that 
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fluctuation theorems restrict the form of the probability distribution in a significant 
manner only for small mean entropy changes, but it is possible to improve these bounds 
upon incorporating additional information. Furthermore, we derived rigorous lower and 
upper bounds on the probability of the rare event where the entropy production is 
negative. The bounds depend on the mean entropy production and its variance, two 
quantities which are more readily accessible to measurement than the probability of this 
rare event. Finally, we presented a systematic way to derive inequalities which result 
from fluctuation theorems. These go beyond the standard derivation of the second law. 
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